ReNew: A Semi-Supervised Framework for Generating Domain-Specific Lexicons and Sentiment Analysis
نویسندگان
چکیده
The sentiment captured in opinionated text provides interesting and valuable information for social media services. However, due to the complexity and diversity of linguistic representations, it is challenging to build a framework that accurately extracts such sentiment. We propose a semi-supervised framework for generating a domain-specific sentiment lexicon and inferring sentiments at the segment level. Our framework can greatly reduce the human effort for building a domainspecific sentiment lexicon with high quality. Specifically, in our evaluation, working with just 20 manually labeled reviews, it generates a domain-specific sentiment lexicon that yields weighted average FMeasure gains of 3%. Our sentiment classification model achieves approximately 1% greater accuracy than a state-of-the-art approach based on elementary discourse units.
منابع مشابه
Creating Domain-Specific Sentiment Lexicons via Text Mining
Sentiment analysis aims to identify and categorize customer’s opinion and judgments using either traditional supervised learning techniques or unsupervised approaches. Traditionally, Sentiment Analysis is performed using machine learning techniques such as a naive Bayes classification or support vector machines (SVM), or could make use of a sentiment lexicon, that is, a list of words that are m...
متن کاملLCCT: A Semi-supervised Model for Sentiment Classification
Analyzing public opinions towards products, services and social events is an important but challenging task. An accurate sentiment analyzer should take both lexicon-level information and corpus-level information into account. It also needs to exploit the domainspecific knowledge and utilize the common knowledge shared across domains. In addition, we want the algorithm being able to deal with mi...
متن کاملComprehensive Study on Lexicon-based Ensemble Classification Sentiment Analysis
We propose a novel method for counting sentiment orientation that outperforms supervised learning approaches in time and memory complexity and is not statistically significantly different from them in accuracy. Our method consists of a novel approach to generating unigram, bigram and trigram lexicons. The proposed method, called frequentiment, is based on calculating the frequency of features (...
متن کاملCross-Domain Co-Extraction of Sentiment and Topic Lexicons
Extracting sentiment and topic lexicons is important for opinion mining. Previous works have showed that supervised learning methods are superior for this task. However, the performance of supervised methods highly relies on manually labeled training data. In this paper, we propose a domain adaptation framework for sentimentand topiclexicon co-extraction in a domain of interest where we do not ...
متن کاملDebunking Sentiment Lexicons: A Case of Domain-Specific Sentiment Classification for Croatian
Sentiment lexicons are widely used as an intuitive and inexpensive way of tackling sentiment classification, often within a simple lexicon word-counting approach or as part of a supervised model. However, it is an open question whether these approaches can compete with supervised models that use only word-representation features. We address this question in the context of domain-specific sentim...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014